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The feed-forward relationship naturally observed in time-dependent processes and in a diverse 
number of real systems -such as some food- webs and electronic and neural wiring- can be described 
in terms of so-called directed acyclic graphs (DAGs). An important ingredient of the analysis of 
such networks is a proper comparison of their observed architecture against an ensemble of random- 
ized graphs, thereby quantifying the randomness of the real systems with respect to suitable null 
models. This approximation is particularly relevant when the finite size and/or large connectivity 
of real systems make inadequate a comparison with the predictions obtained from the so-called 
configuration model. In this paper we analyze four methods of DAG randomization as defined 
by the desired combination of topological invariants (directed and undirected degree sequence and 
component distributions) aimed to be preserved. A highly ordered DAG, called snafce-graph and 
a Erd os-Renyi DAG were used to validate the performance of the algorithms. Finally, three real 
case studies, namely, the C. elegans cell lineage network, a PhD student-advisor network and the 
Milgram's citation network were analyzed using each randomization method. Results show how the 
interpretation of degree-degree relations in DAGs respect to their randomized ensembles depend on 
the topological invariants imposed. In general, real DAGs provide disordered values, lower than the 
expected by chance when the directedness of the links is not preserved in the randomization process. 
Conversely, if the direction of the links is conserved throughout the randomization process, disorder 
indicators are close to the obtained from the null-model ensemble, although some deviations are 
observed. 
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I. INTRODUCTION 

Many relevant properties of complex systems can be 
described by an appropriate network representation of 
their elements and interactions [iHl] . Most of these net- 
works are directed, i. e. there is a directional relation- 
ship between two elements defining who influences who 
in a given order. Among the class of directed networks, 
directed acyclic graphs -henceforth, DAGs- are an im- 
portant subset lacking feedback loops. This is specially 
suitable for the representation of evolutionary, develop- 
mental and historical processes in which the time asym- 
metry determines a feed-forward (acyclic) flow of causal 
relations. In this context, DAGs constitute a formal rep- 
resentation of causal relations that display the direct ef- 
fects of earlier events over latter ones. Citation networks 
are among their most paradigmatic cases [1, 0] ■ In these 
networks nodes are scientific articles and directed links 
(or arcs) stand for bibliographic citations among them. 
According to a chronological order, directed links are es- 
tablished from former articles to newer ones in a feed- 
forward manner. 

In general, time-dependent processes have been for- 
malized as DAGs. Examples of that comprehend arti- 
cle and patent citation networks Q, decision jurispru- 



dence processes [H, [Ti| and tree genealogies and phylo- 
genies. Moreover, other relevant systems such as stan- 
dard electric circuits fl^ . feed-forward neural jT3| and 
transmission networks |14l | are also suitably represented 
as DAGs. 

The main objective of this paper is to explore the ran- 
domness -in topological terms- of real systems displaying 
a directed acyclic structure by the defintion a collection 
of randomization methods that preserves a fixed number 
of topological invariants. To this end, the design of null 
models to highlight the particular features characterizing 
a_ system with respect to a neutral or random scenario 
is needed. In this context, the so-called configura- 
tion model 

[1 i, d, [ill has been probed as a fruitful 
approximation to provide a null-model scenario of what 
is expected by chance in complex networks under the 
assumptions of sparseness, infinite size and lack of cor- 
relations. However, little attention has been paid con- 
cerning DAGs. Indeed, a rigorous definition of random 
DAG from its directed degree sequence has been recently 
proposed [i3|, rising the interest for its study through 
configuration model approach. Borrowing the methodol- 
ogy to build random undirected graphs [11, , degree 
sequence is visualized as a set of edge stubs. Hence a 
random DAG is constructed by matching stubs accord- 
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ing to certain order constraints until they are completely 
canceled [13 • Without neglecting the important advance 
it represents for the comprehension of acyclic networks, 
some problems arise in using this methodology as the null 
model reference of real nets. Firstly, this methodology is 
dependent on how probable is to construct a graph from 
a degree sequence, since not all of them produce a graph, 
i.e., they are not graphical. Additionally, configuration 
model assumptions are not fulfilled in real systems due 
to their finite size and the presence of densely connected 
regions. 

An alternative approach used in this work is based 
on iterative processes of edge rewiring over the graph, 
keeping the graphical condition during all the process 
of randomization. This is a relevant issue since the de- 
gree sequence, either directed or undirected, imposes a 
particular space of topological configurations rather lim- 
ited -as we shall see in this work- for DAGs. Attending 
to this approach we can estimate where a real graph is 
placed preserving a graphical ensemble that holds some 
topological invariants. The two fundamental topological 
invariants considered in this work for a null model com- 
parison are the degree sequence (either directed or undi- 
rected) and the component structure. The both types of 
degree sequences and the degree distribution have been 
typically chosen as invariant in the random model con- 
struction 0, [l^ [13, [l^. However, as is well known in 
random graph theory, the existence of some graph satis- 
fying a given degree sequence does not guarantee a single 
connected component containing the whole set of nodes, 
except at high connectivities. Therefore sparse networks 
representing connected systems are expected to be frag- 
mented during a randomization process. This may be 
an undesirable effect when studying historical processes 
since it breaks the fiow of causality. Besides this problem, 
there are also real systems that display more than a single 
connected component. Those disconnected components 
do not interact among them and, arguably, can be consid- 
ered to be independent systems in terms of causality. It is 
worth to note that preservation of connected component 
in graph randomization processes has recently raised the 
interest of network community [20j . 

According to the above considerations, in order to pro- 
duce comparable ensembles for the evaluation of the ran- 
domness of a real DAG, we propose a collection of four 
randomization methods for DAGs that ensure the topo- 
logical invariants mentioned above. These randomiza- 
tion techniques were applied to two extreme -in terms of 
degree-degree relations- network models: an Erdos Renyi 
DAG and a highly ordered graph called snake-D AG. Our 
methodology was then applied to three real DAGs: a ci- 
tation network, a PhD-student advisor network and the 
cell lineage in the development of caebnorhabditis elegans 
worm. 

The paper is organized as follows: Section II offers the 
basic concepts related to DAGs. Section III explicitly de- 
fines the set of four randomization algorithms according 
to different topological invariants. Section IV describes 



and characterizes the randomness indicators and apply 
the randomization processes to the systems under study: 
two toy models -which enable us to validate the perfor- 
mance of the algorithms- and three real systems. Section 
V discusses the relevance of the obtained results. 



II. ORDERING, CAUSALITY AND FORMAL 
DEFINITION OF DAGS 



In this section we discuss some mathematical proper- 
ties of DAGs and their interpretation in terms of causal 
relations. We finally address the problem of component 
structure conservation. 



A. Basic definitions 

Let G{V, E) be a directed graph, being V = {vi, v^} 
the set of nodes, and the set of ordered pairs E = 
{{vk,Vi),...,{vj,vi)} the set of edges -where the order, 
{vk, Vi) implies that there is an arrow in the following di- 
rection: Vk Vi. The underlying graph Qu of a directed 
graph Q is an undirected graph with the same set of nodes 
G, but whose edges are undirected (i.e. {vk, Vj) G E, then 
{{vj^Vk), {vk,Vj)} e Eu). Given a node Vi S V, the num- 
ber of outgoing links, to be noted ko{vi), is called the 
out-degree of Vi. Similarly, the number of ingoing links of 
Vi is called in-degree of Vi, noted by ki{vi). 

A DAG is a directed graph characterized by 
the absence of cycles: If there is a directed path 
from Vi to Vk (i.e., there is a finite sequence 
{vi,Vj), {vj,vi), {vi,Vs), {vm,Vk) G E) then, there is no 
directed path from Vk to Vi. Borrowing concepts from 
order theory [2l|, , we refer to nodes with fc^ = as 
maximals and those with fco = as minimals. The ab- 
sence of cycles ensures that at least there is one minimal 
node and one maximal node. Maximal nodes can be seen 
as inputs of a given computational or sequential process 
while minimal -or terminal- ones are the outputs of such 
a process. Furthermore the acyclic nature permits to de- 
fine a node ordering by labeling all nodes with sequential 
natural numbers. Thus, in a DAG there is at least one 
numbering of the nodes such that: 



{y{v,,v,)eE)^{t>j). 



(1) 



For this reason, DAGs have been also referred as ordered 
graphs p7| . 



B. Random DAGs 



The theoretical roots of the concept of a random DAG 



are based on the so-called directed degree sequence [17 1 
-as well as the concept of random graph [l6j . A random 
DAG (y is a randomly chosen element of an ensemble of 
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DAGs which share the directed degree sequence, denoted 
by d{Q), which is defined as follows: 

diG) = ih{vi),ko{vi)),...,ih{vi),koiv,)),... (2) 

The two numerical quantities composing every element of 
such a sequence, ki{vk) and ko{vk), encode the pattern of 
connectivity of every node of the graph. In general, the 
ensemble of random graphs containing N nodes is com- 
posed by all possible graphs whose connectivity pattern 
satisfies the directed degree sequence. If we only pay 
attention to the number of edges connected to a given 
node Vi -regardless the direction of the arrows- we define 
the degree of the node Vi as k{vi) = ki{vi) -f fco(wi) [111 
and, consistently, the undirected degree sequence of Q, is 
defined as: 

du{Q) ^ k{vi),...,k{vi),... (3) 

However, it is clear that not any sequence of N pairs of 
natural numbers -or N natural numbers in the case of 
the undirected degree sequence- represents the degree se- 
quence of an ensemble of some kind of random graphs 
containing N nodes [IB, [13] • There are several restric- 
tions that a (un)-directed degree sequence must satisfy 
in order to represent a proper graph, i.e., a sequence to be 
graphical or feasible (la |. In the case of directed graphs, 
in and out degrees of the whole sequence must be consis- 
tent with the number of edges, i.e.: 

Y^k,{v,)^Y.^,{v{)^\E\, (4) 

2<n 'i<n 

It is clear that such a condition does not avoid the pres- 
ence of cycles in the network structure. Consistently with 
the claim that DAGs depict systems where some unavoid- 
able ordering among nodes is at work, we can ensure the 
generation of a given DAG if and only if there is a label- 
ing of the nodes such that Vi — >■ Vj implies that i > j ^29.] . 
Taking into account this ordering to build the graph, the 
directed degree sequence must also hold two conditions. 
First, 

ko{vi) = ki{vn) = 0, (5) 

and second, 

j<i j<i 

Under conditions (|4l5|6p it is ensured that a directed de- 
gree sequence will be graphical and able to represent the 
degree sequence of a given non-empty ensemble of DAGs. 

C. Component structure and causal relations 

The concept of component structure stems from 
the notion of undirected path: given two pairs of 
nodes Vi,Vk G V, there is an undirected path among 



them if there is a finite sequence of undirected edges 
such that it can be ordered sequentially, for example 
{vi,Vj},{vj,vi},{vi,Vs},...,{vm,Vk} e Eu- A component 
of is a (maximal) subset of V by which an undirected 
path can be defined among any pair of nodes. The spe- 
cial features of a DAG impose constraints on the number 
of -DAG like- components from a given directed degree 
sequence. Indeed, let M{G) ~ {vk G V : ki{vk) = 0} 
be the set of maximal nodes of a given DAG Q and 
P^(Q) = {vk G V : ko{vk) = 0} the set of its minimal 
nodes. Let d{Q) be the directed degree sequence of Q 
which, by assumption, is graphical. Then, the number of 
(DAG) components of the graph c{Q) is bounded by 

c(a) <min{|Af|,|/i|}, (7) 

since any connected DAG must have, at least, = 
\M\ — 1. Another constraint must be satisfied. There 
must exist a partition of the directed degree sequence 
d{Q) by which all c{Q) subsequences are graphical, i.e., 
they satisfy equations (|4|5|6p . 

III. RANDOMIZATION METHODS 

This section describes four methods to obtain random- 
ized ensambles of DAGs, preserving a number of invari- 
ants. The four algorithms presented below perform ran- 
dom rewirings according to constraints based on (un)- 
directed degree sequence and component structures, all 
of them explicitly avoiding the presence of multi-edges. 
Algorithms presented in this work are illustrated in fig- 
ure [TJ For the sake of clarity the order of presentation 
of the methods is the same for all figures and tables, and 
thus the letters used in fig. ^ inequivocaly identify the 
methods of randomization. 



A. Generating the ensemble from the undirected 
degree sequence 

The simplest method of randomization preserving 
components consists of applying a random numbering 
to Q"^ . This allows us to define an order criteria to es- 
tablish the direction of arrows. In this case, given an 
undirected pair {vi,Vj} we say that if i > j we defined 
the order pair as {vj,Vi), otherwise {vi,Vj). Since 5" 
is preserved, undirected degree-degree relations are also 
conserved. Then, a suitable randomization requires some 
additional process of link rewiring to destroy the presence 
of degree-degree relations. In this context, we provide a 
methodology that combines a rewiring process that pre- 
serves undirected degree sequence -see eq. 1^-, compo- 
nent structure and renumbering. 

This first randomization method, denoted by the letter 
a, is depicted in fig. ([1^). The steps of the algorithm are 
scheduled in the following: 

1. Given a DAG, G{V, E), we obtain its respective un- 
derlying network, Gu- 
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FIG. 1: Schematic representation of the four different DAG 
randomizations proposed in this work. Methods are alpha- 
betically denoted (a to d). Method a: randomization pre- 
serving undirected degree sequence and component size dis- 
tribution. Method b: DAG randomization only preserving 
the undirected degree sequence. Method c: randomization 
preserving directed degree sequence and component size dis- 
tribution. Method d: randomization preserving only directed 
degree sequence. 



2. We obtain a random network conserving the undi- 
rected degree sequence of and its compo- 
nent structure by a randomization process denom- 
inated local-swap [2(]j. Local-swap is performed 
as follows: We randomly select an existing edge 
{vi,Vj} of Gu such that the two additional edges 
{vi,Vk} and {vj,vi} also exist in provided that 
Vk,Vi,Vj,vi are all different. Then we proceed to 
make the rewiring, by generating the edges {vi, vi} 
and {vj,Vk}; and removing the edges {vi,Vk} and 
{vj, vi}. If {vi, vi} or {vj, Vk} already exist in Qu we 
abort the operation and we randomly select another 
edge satisfying the above described conditions to 
perform the local swap. According to [20], local- 
swap method can perform all rewirings of links ex- 
cept those that imply the breaking of the compo- 
nent structure. This process is iteratively repeated 
until achieving a suitable randomization of Gu or 
after a predefined number of iterations. 

3. Once the local-swap randomization of Qu is done, 
we label every node with an arbitrary natural num- 
ber, from 1 to N, being A'' the size of the graph. 
No repetitions are allowed. 

4. We now proceed to define the arrows taking into 
account the numbering of the nodes defined in the 



previous step. For every pair of connected nodes 
in the randomized version of Gu, we define the ar- 
row from lower to higher number's nodes. Formally, 
given a undirected pair {vi, Vj} where i, j are the re- 
spective labels obtained through the random num- 
bering, ii i > j then Vi Vj, otherwise Vi <— w^. 
The total order of natural numbers avoids the pres- 
ence of cycles. 

Method b consists of preserving the undirected degree 
sequence but not preserving component structure. Com- 
ponent structure is ensured by step 2) in method a. In 
this case, step 2) is replaced by a direct rewiring pro- 
cess: i) selecting a pair of different edges {vi,Vk}, {vj,vi} 
of Gu] ii) Generate with probability p — 1/2 either the 
edges {vi,vi},{vj,Vk} or the edges {vi,Vj),{vi,Vk} (pro- 
vided that both two edges are not already present) and 
Hi) remove the edges {wi,Wfc}, {vj,vi} -see fig. (Hb). 



B. Generating the ensemble from the directed 
degree sequence 

Beyond the randomization of the raw topological struc- 
ture of the real DAG conserving component structure, 
one could be interested in the preservation of the directed 
degree sequence -see eq. © . This has an important phys- 
ical interpretation, since it implies that every node has 
an invariant number of inputs and outputs, as it happens 
with the components of an electronic device. Under such 
a restriction we can no longer work with the underlying 
graph but with the directed graph. 

The proposed algorithm, denoted by method c -see 
fig- begins with a numbering of the nodes result- 

ing from the application of a leaf-removal algorithm (23j 
and a rewiring operation constrained by this number- 
ing. Let us briefly revise how a leaf-removal algorithm 
works: From the original graph, G, we iteratively remove 
the nodes with fco = until the complete pruning of the 
graph. According to this, a DAG can be layered, and thus 
a partial order between nodes can be easily established. 
Formally, the j-th iteration of the leaf-removal algorithm 
defines the set Vi C of nodes where Vi corresponds the 
i-th layer of the DAG. Then, any DAG can be redefined 
in terms of the resulting -ordered- layers of a leaf-removal 
algorithm, i.e.. 



W(G)^{Wi,...,Wi} 



(8) 



where no link between nodes of the same layer is estab- 
lished. 

Method c -see fig. ([TJ:)- is defined as follows: 

1. Generate the set W{G) by applying the leaf removal 
algorithm. 

2. Perform a random numbering of the nodes in such 
a way that, given Vi G Wu, and Vk G Ws, 



(u > s) {i > k) 



(9) 
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3. Select at random an edge {vk,Vj) € E. Then we 
look for the presence of two nodes Vi, vi £ V hy 
which either: 

i) {vi,Vk),{vi,Vj) £ E, or (10) 
m) {vk,Vi), {vj,vi) e E. (11) 

Notice that the absence of cycles makes these two 
options mutually exclusive. 

4. If the condition ([T0| is satisfied, the pairs (w;,Ufc) 
and {vi , Vj ) are generated and {vi ,Vk),{vi, Vj ) 
deleted, provided that the following conditions are 
satisfied: l){vi,Vk){vi,Vj) ^ E and 2)1 > k and 
i > j. If one of these two conditions does not hold, 
the rewiring event is aborted and another edge is 
newly selected at random. 

If condition ([TT|) is satisfied, the pairs {vk,vi) 
and {vj,Vi) are generated, deleting {vk,Vi, ){vj,vi) 
links, provided that {vk,vi){vj,Vi) ^ E and k > I 
and j > i conditions are satisfied. Again, if one 
of these two conditions does not hold the rewiring 
event is restarted. 

Finally, the randomization method d preserves the di- 
rected degree sequence but do not preserve the com- 
ponent strucutre. In this case, step 3) is replaced 
by the following procedure: i) select two edges at 
random {vk,Vi), {vi,Vj) £ E] ii) generate the edges 
(vkyVj), {vi,Vi) provided that {vk,Vj),{vuVi) ^ E and 
that k > j,l > i. If some of these conditions does 
not hold, process is aborted and we restart the rewiring 
event, in) If conditions are satisfied, {vk,Vi), {vi,Vj) are 
removed -see fig. ([TJl) . 

IV. EXPLORING THE RANDOMNESS OF 
DAGS 

In this section we apply the above defined algorithms 
to some real topologies to construct an ensemble of ran- 
domized networks (also known as surrogate data in other 
scientific communities) preserving the defined topological 
invariants. First of all, we need to define proper measures 
to evaluate the level of randomness of our systems. 

A. Testing the success of the randomization 
process 

As it is described above, randomizations are subject to 
very restrictive constraints since not all (un)-directed de- 
gree sequence configurations are graphical. Therefore, 
the success of DAG randomization processes must be 
properly evaluated. Two estimators were measured for 
this purpose. First, a dissimilarity parameter 2? is pro- 
posed to measure how the graph evolves along the itera- 
tions with respect to the original one. Second, the deteri- 
oration of present degree-degree relations is also reported 



by means of an estimator borrowed from information the- 
ory, the so-called joint entropy [24i] . 



1. Dissimilarity 

The dissimilarity parameter T) between two graphs is 
the relative frequency of link mismatches between them, 
i.e., the Hamming distance of their adjacency matrices. 
In the context of a randomization process, let us define A 
and A* as the adjacency matrices of an original graph {Q) 
and the graph resulting from the application of t random- 
ization iterations (^y*), respectively. Their dissimilarity 
can be expressed as 

T^iG,G')^J^^^^~S{A,,Al,), (12) 
i,j 

where 6 is the Kronecker's delta and \E\ denotes the num- 
ber of links of both Q and C?*, since the undirected degree 
sequence is preserved in the four randomization methods. 



2. Degree- degree joint entropy 

Given two random variables, X,Y, the joint entropy 
between X and Y, H{X,Y) is given by: 

HiX, Y) = -Y, y) log(P(2;, y), (13) 

being P(X, Y) the joint probability of the pair of out- 
comes X, y happening together -throughout this paper 
log2 will be used. Let us detail how every concept is 
translated in a useful way to become graph measures. 

Joint entropy for the evaluation of degree-degree rela- 
tions can be expressed as 

i?(eu) = 5] P(i,j)logP(*,j) (14) 

where P(i, j) defines the probability of finding a randomly 
selected link that connects two nodes Vm,vi G V such 
that k{vm) — i, k(vi) = j. This measurement was found 
to be more appropriate than other existing altervatives 
for the purpose of monitoring the de gree -degree interplay 
along the randomization processes [301 • The subscript 
"w" emphasizes that such a measure does not take into 
account the directed nature of the graph. Joint entropy 
quantifications for degree-degree considering the directed 
degree sequence can be easily derived. In this case three 
additional joint entropies attending dircctedness can be 
considered, namely the ones accounting for kiko, kiki and 
koko relations. Although more elaborated definitions of 
this probability can be proposed, for the sake of simplic- 
ity we assessed whether two nodes with given degrees 
tend to be connected, not matter the direction of the ar- 
row connecting them. Then the (fcifci)-joint entropy of a 
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directed graph Q, H'^'^{Q) is expressed as: 



random-DAG model 



w^\g)^Y.^,4k,j)\og¥.4k,j), (15) 

where Pi^i{k,j) is the probability of that a hnk chosen 
at random connects a node with ki — i to another with 
ki = j. A similar expression is obtained for H°'°{Q). 
Finally, H''°{g) is defined as: 



'(0) =5^P.,o(fc,j)logP.,o(fc,j)- (16) 



Notice that this is the only case where Fi^o{k,j) ^ 

F^,o(J,fc). 

The ensemble of random graphs produced from a orig- 
inal graph Q after t iterations can be associated to the 
undirected degree-degree joint entropy distribution of its 
conforming graphs, which can be characterized by its 
mean {H{Ql^)) and its standard deviation (j{H{Ql^)). The 
closeness of the joint entropy value of the original graph 
to the ensemble distribution can be quantified by means 
of the Z-score, which reads: 



The first one is a completely degree-degree disordered 
DAG, up to finite size effects. Let V be a set of N nodes, 
by which the probability for two of them to be connected 
is constant and equal to p. This is the definition of the 
Erdos Renyi (ER) graph. Once we have an ER graph, we 
randomly label the nodes of V sequentially, from 1 to N. 
Finally, we define the direction of the arrows by looking 
at the labeling of the nodes and observing condition ([T|) . 
We will refer to this model as random-DAG. 

We created a random-DAG of |y| = 600 and |£;| = 
6000 and, for each method, an ensemble of 500 random- 
ized graphs product of 2^^ iterations. See fig. ([2^) for an 
example of this graph. As shown in Table HI none of the 
degree-degree relations of the random-DAG where nei- 
ther significantly low nor significantly high with respect 
to any of its randomized ensembles. This result indicates 
that the four randomization methods proposed here do 
not produce significant undesirable biases in the degree- 
degree almost null relations of a originally random-DAG. 



2. snake-DAG model 



ZiGu) = 



<y{H{gi)) ■ 



(17) 



The statistical significance level was set at p < 0.001 
which for two tails corresponds to \Z\ > 3.27. Signifi- 
cant values were denoted by Z* in the tables describing 
joint entropy values of graphs. Values of Z < 3.27 means 
that the degree-degree relations at the original network 
Q are significantly high respect to the Z— distribution 
of its random ensemble. Values oi Z > 3.27 means 
that the degree-degree relations at the original network 
g are significantly low respect to the Z— distribution of 
its random ensemble. Finally, values within the range 
[—3.27, 3.27] indicate that no significant differences in the 
degree-degree relations were found between the original 
graph and its randomized ensemble. Analogously, we can 
compute (i?*'°(g*)), {H'''\g*)), {H°^°{g^)) and its associ- 
ated Z-scores at the step t of the randomization process. 



B. Extreme Graphs 

Prior to evaluate the randomness of real DAGs, we 
construct two extreme topologies in order to evaluate the 
behavior of the algorithms using the measures defined 
above. The first model, random-DAG, permit us to test 
the randomization methods in a highly disordered degree- 
degree scenario. In terms of degree-degree joint entropy, 
minimal changes are expected along the randomization 
processes. The second model, snafce-DAG, permit us the 
same test but in a highly ordered scenario where large 
increments of joint entropy values should be observed. 



Opposed to the random-DAG model, we construct a 
highly degree-degree ordered acyclic graph, which we will 
call snake-DAG see fig. In this graph, nodes of the 
same degree tend to be connected among them, giving 
rise to a high degree-degree relation and thus very low 
joint entropy values. In the following lines we outline the 
construction of this network. 

Let us consider K = fc™°^ as the highest outdegree to 
appear in the resulting graph. Let V be the set of nodes 
such that there exists an integer n by which n ■ K — \V\. 
We then perform a partition of V in K different subsets 



P{V)^{Vi,...,Vk}. 



(18) 



In this partition, for any Vi £ ^{V), \Vi\ — n. We se- 
quentially number the nodes of the set V is the following 
way: For the subset of nodes Vi , the label will run from 
1 to n, thus obtaining: 

Vi = {vi, ...,u„} 

For the subset of nodes V2 , the label will run from n + 1 
to 2n: 

V2 = ...,W2n}- 

We follow the numbering by using the criteria that the 
nodes of subset Vi will be labeled from [i — l)n -I- 1 to 
i ■ n, until all the nodes of V are numbered. We then 
identify the label of the partition with the out-degree of 
the nodes belonging to it, namely: 



(Vi e Vm) =^ iko{vi) = to). 



(19) 



Now we proceed to define the connections: For 
any Vi € Vm, we will have the following links 



FIG. 2: Illustration of a Random-D AG with N = 60 and (fc> = 3 (a). Illustration of a Snake-D AG with N = 60 and K ^ 6 
(b). In both graphs arrows go from nodes with larger numbers to nodes with smaller number. 



TABLE I: Joint entropy values for a Random-DAG of \V\ — 600 and \E\ — 6000 and a set of 500 randomized graphs after 2^* 
iterations of each of the four randomization methods (alphabetically denoted). Symbol (*) denotes significant differences. 



mc 



;tliod V H"(g'.) if'-°(g') H'''(g') H°-°(g*) 



G orig. - 8.279 9.214 9.163 9.250 

a 0.98 8.283 ± 0.003 (Z=-1.67) 9.21 ± 0.02 (Z=0.25) 9.20 ± 0.04 (Z=-0.95) 9.21 ± 0.04 (Z=1.01) 

b 0.98 8.283 ± 0.003 (Z=-1.46) 9.21 ± 0.03 (Z=0.21) 9.20 ± 0.04 (Z=-0.90) 9.21 ± 0.04 (Z=1.02) 

c 0.96 8.282 ± 0.003 (Z=-1.03) 9.209 ± 0.003 (Z=1.92) 9.157 ± 0.004 (Z=1.76) 9.249 ± 0.004 (Z=0.03) 

d 0.96 8.281 ± 0.003 (Z=-0.62) 9.206 ± 0.002 (Z=2.96) 9.152 ± 0.004 (Z=3.03) 9.247 ± 0.004 (Z=0.68) 



{vi,Vi-i), {vi,Vi-m) ■ This process excludes node vi 
which will only receive a link from V2. We observe that, 
in general, both vi and V2 belong to Vi . Finally, to break 
the extreme symmetry of the obtained net, we introduce 
a minimal source of noise by renumbering a small frac- 
tion ~ 0.05 of the nodes with a further arrow orientation 
consistently with the new numbering, as depicted in eq. 
©. 

Analogous to the experiment performed with a ran- 
dom- DAG, we create a snake-T)KG of \V\ — 600 and 
£'1 = 2099 {K = 6) and, for each method, an ensemble 
of 500 randomized graphs product of 2^^ iterations. See 
fig. for an example of this graph. As shown in Table 
im all the degree-degree relations of the snake-T)KG were 
significantly high with respect to any of the randomized 
ensembles. This result indicates that the four random- 
ization methods proposed here are able to successfully 
deteriorate the high degree-degree relations present at 
the snafce-DAG. 



C. Real biological and social DAGs 

Results of previous section have checked the behavior 
of the four methods in two toy models with high and 



low degree-degree relations respectively. In this section 
we proceed to evaluate three DAGs representing real sys- 
tems: the C. elegans cell lineage network, the Milgram's 
citation network and a PhD student-advisor network. 



1. C. elegans cell lineage network 

The first system chosen is a cell lineage network. 
Briefly, it captures the genealogic pedigree of cells related 
through mitotic division during its development in a tree- 
like structure. The cell lineage network of C. elegans was 
retrieved from the WormBase [3l| C. elegans repository. 
In this network the initial egg division (the giant compo- 
nent) and alternative variants of neural post-embrionic 
cell lines are included in a 18-component graph represen- 
tation. All the randomization methods were applied up 
to 2^* iterations. The dissimilarity values reached were 
over 0.90 in all cases, indicating a successful alteration of 
most of the links under the different topological invari- 
ants. 

Figure ([3]) shows the original and a representative DAG 
for each randomization method. Note the deep fragmen- 
tation produced by method b, where only the undirected 
degree sequence is preserved (205 components in fig. (jJb) 
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TABLE II: Joint entropy values for a snake-DAG {\V\ = 600 and K = 6) and a set of 500 randomized graphs after 2^** iterations 
of each of the four randomization methods (alphabetically denoted). Symbol (*) denotes significant differences. 



method 


















9 orig. 




2.998 


4.331 




4.131 




4.161 




a 


0.99 


5.281 ± 0.003 (Z=-691.88*) 


6.98 ± 0.02 (Z= 


-152.2*) 


6.97 ± 0.03 (Z= 


-91.53*) 


6.97 ± 0.03 (Z= 


-86.86*) 


b 


0.99 


5.281 ± 0.003 (Z=-671.59*) 


6.98 ± 0.02 (Z= 


-174.3*) 


6.97 ± 0.03 (Z= 


-91.83*) 


6.97 ± 0.03 (Z= 


-92.05*) 


c 


0.95 


4.96 ± 0.03 (Z=-58.88*) 


5.37 ± 0.02 (Z= 


-52.96*) 


5.50 ± 0.02 (Z= 


-63.42*) 


5.12 ± 0.02 (Z= 


-42.22*) 


d 


0.93 


4.71 ± 0.03 (Z=-51.63*) 


5.15 ± 0.02 (Z= 


-43.89*) 


5.22 ± 0.03 (Z= 


-41.95*) 


5.05 ± 0.02 (Z= 


-40.23*) 




FIG. 3: DAG representation of the cell lineage of Caehnorhahditis elegans (inbox). A prototypie randomized network after 2^* 
iterations is present for each method: randomization preserving undirected degree sequence and component size distribution 
(a), randomization only preserving the undirected degree sequence (b), randomization preserving directed degree sequence 
and component size distribution (c) and randomization preserving only directed degree sequence (d). Panel (e) represents the 
dissimilarity preserving the original network along the process of randomization for every randomization type. The mean and 
the standard deviation of 500 graph randomizations are shown for each point. 



TABLE III: Joint entropy values for the original C. elegans network and a set of 500 randomized networks after 2 iterations 
of each of the four randomization methods (alphabetically denoted). Symbol (*) denotes significant differences and (") denotes 
that Z-score is not computable due to cr = 



method 


D 














g orig. 




1.832 




0.991 


0.116 




1.732 


a 


0.99 


1.833 ± 0.001 (Z= 


-0.58) 


3.71 ± 0.01 (Z=-205.9*) 


3.67 ± 0.02 (Z= 


-167.4*) 


3.66 ± 0.02 (Z=-92.88* ) 


b 


1.00 


1.961 ± 0.001 (Z= 


-117.3*) 


3.71 ± 0.01 (Z=-224.5*) 


3.69 ± 0.02 (Z= 


-175.0*) 


3.68 ± 0.02 (Z=-92.55*) 


c 


0.90 


1.831 ± 0.002 (Z= 


-0.41) 


0.990 ± 0.001 (Z=1.33) 


0.116 ± 0.0° 




1.736 ± 0.002 (Z=-2.67) 


d 


0.99 


1.832 ± 0.002 (Z= 


=0.80) 


0.989 ± 0.001 (Z=-1.6) 


0.116 ± 0.0" 




1.734 ± 0.002 (Z=-1.67) 
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against 18-component in the original graph). Interest- 
ingly, figure ([3]d) shows that the tree-like structure and 
the number of graph components are conserved by just 
only preserving the directed degree sequence invariant. 
The reason is that the regular pattern of /c^ = 1 for 
all non-maximal nodes in its directed degree sequence 
is graphical only in a tree structure. In this particular 
situation the number of DAG components coincides with 
the number of maximal nodes. However, the size of com- 
ponents is not strictly conserved and this condition is 
only possible by applying the local-swap as observed in 
figures [5^ and[5J:. 

Looking at figure ([3]5), all randomizations provide com- 
pletely re-allocation of nodes. It is worth to note that 
methods not preserving the directed degree sequence 
start with a dissimilarity values of around 0.5. The rea- 
son is that a complete random arrow orientation occurs 
for every iteration. This contrasts with methods c and 
d where rewiring operated over the directed graph. Ta- 
ble (jllip shows that almost all the ensembles generated 
through the proposed algorithms display relevant devi- 
ations in the values of joint entropies. They are higher 
than the observed in the real DAG when the undirected 
degree sequence is preserved (methods a, b). Otherwise, 
when the directed degree sequence is conserved, no devia- 
tions are found (methods c, d) due to the very restrictive 
(even zero) standard deviations. 



2. Milgram's citation network 

The second system is a sample of the process of article 
citation. The chosen system used to illustrate this pro- 
cess is the resulting network containing the papers that 
cite "S Milgram's 1967 Psychology Today" paper or use 
Small World in the title. This network was retrieved 
from to Pajek's network dataset [s^]- All the random- 
ization methods were applied up to 2^^^ iterations (see 
figure HI). The dissimilarity values reached were over 0.86 
for methods a and b and over 0.76 for methods c and d. 
This indicates that keeping the directed degree sequence 
as a topological invariant reduces the heterogeneity of 
feasible graphs and thus the dissimilarity reached. 

Table (|IVp shows significantly low joint-entropy val- 
ues, indicating that this DAG displays a statistically rele- 
vant undirected degree-degree relation respect to all their 
randomized ensembles. It can also be appreciated rel- 
evant indegree-outdegree and outdegree-outdegree rela- 
tions with respect to all their randomized ensembles. In 
the case of indegree-indegree entropies, it shows a high 
degree-degree relation respect to the randomized ensem- 
bles produced by methods a and b, while there were 
no differences respect to the randomized ensembles pro- 
duced by methods c and d. This result indicates that 
indegree-indegree relations in the Milgram's citation net- 
work are high respect to the random DAGs that preserve 
only the undirected degree sequence and not differen- 
tiable from the ones obtained by preserving the directed 



degree sequence. 

This example illustrates how a randomization process 
destroys local associations and the heterogeneous parti- 
tion observed in the original DAG (figure|3]inbox) . In this 
case, due to the high connectivity of the original DAG, -it 
is worth to note that such a graph contains several nodes 
whose connectivity is OdT^I)- fragmentation is unlike to 
happen due to a high average connectivity. Furthermore, 
as a side effect, an upper boundary below maximal value 
of dissimilarity is imposed depending on the randomiza- 
tion method used. This is due to a considerable frac- 
tion of failed rewiring attempts. An example of that is 
provided by a clique conformation where no rewiring is 
possible. In this case no effective of rewiring can be done 
since all possible link combinations satisfying the directed 
acyclic condition are actually in the network. 



3. PhD student-advisor network 

The last system evaluated in this paper contains the 
ties between PhD students and their advisors in theoret- 
ical computer science. Each arc points from an advisor 
to a student. Data was retrieved from to Pajek's network 
dataset [11] ■ This network illustrates just the intermedi- 
ate situation between the two previous examples. It is 
a DAG able to be fragmented (when DAG components 
conservation is not imposed) but with just right connec- 
tivity: too low to avoid fragmentation but not too high to 
impose an upper bounding in dissimilarity, being, jointly 
to the ranrfom-DAG studied above, the DAG structure 
closer to the assumptions of the configuration model. In- 
terestingly, contrasting with the C. elegans case, network 
fragmentation also occurs when preserving directed de- 
gree sequence but not when the component distribution 
conservation is preserved. All the randomization meth- 
ods were applied up to 2^^ iterations. The dissimilarity 
values reached where over 0.97 in all cases, indicating a 
successful alteration of most of the links under the dif- 
ferent topological invariants. 

Table [5] displays statistically significant low joint- 
entropy values, indicating that this DAG has relevant 
undirected degree-degree relations with respect to all 
their randomized ensembles. It also displays a statisti- 
cally relevant indegree-indegree and outdegree-outdegree 
relations with respect to all their randomized ensembles. 
In the case of indegree-outdegree, it shows significant re- 
lations with respect to the randomized ensembles pro- 
duced by methods a and b, while there were no differ- 
ences respect to the randomized ensembles produced by 
methods c and d. This result indicates that indegree- 
outdegree relations in the PhD student-advisor network 
are high with respect to the random DAGs that preserve 
only the undirected degree sequence and are not differ- 
entiable from the ones obtained by random DAGs that 
also preserve the directed degree sequence. 
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FIG. 4: DAG representation of the Milgram's citation network (inbox). A prototypic randomized network after 2^® iterations is 
present for each method: randomization preserving undirected degree sequence and component size distribution (a), random- 
ization only preserving the undirected degree sequence (b) , randomization preserving directed degree sequence and component 
size distribution (c) and randomization preserving only directed degree sequence (d). Panel (e) represents the dissimilarity 
respecting the original network along the process of randomization for every randomization type. The mean and the standard 
deviation of 500 graph randomizations are shown for each point. 



TABLE IV: Joint entropy values for the original Milgram's citation network and a set of 500 randomized networks after 2 
iterations of each of the four randomization methods (alphabetically denoted). Symbol (*) denotes significant differences. 



method -D g"(g*) g'-°(g*) H'-'(g*) H'-'ig*) 

g orig. - 9.03 7.38 7.16 7.53 

a 0.87 9.24± 0.02 (Z=-10.5*) 8.2 ± 0.1 (Z=-8.6*) 8.2 ± 0.1 (Z=-7.8*) 8.20 ± 0.1 (Z=-4.8*) 

b 0.87 9.24 ± 0.02 (Z=-10.5*) 8.2 ± 0.1 (Z=-8.6*) 8.2 ± 0.1 (Z=-7.6*) 8.22 ± 0.1 (Z=-5.1*) 

c 0.70 9.18 ± 0.02 (Z=-7.35*) 7.45 ± 0.01 (Z=-7.38*) 7.17 ± 0.01 (Z=-1.42) 7.63 ± 0.01 (Z=-8.59*) 

d 0.70 9.18 ± 0.02 (Z=-7.93*) 7.45 ± 0.01 (Z=-7.34*) 7.17 ± 0.01 (Z=-1.40) 7.63 ± 0.01 (Z=-8.63*) 



TABLE V: Joint entropy values for the PhD student-advisor network a set of 500 randomized networks after 2 iterations of 
each of the four randomization methods (alphabetically denoted). Symbol (*) denotes significant differences. 



method V g"(g* ) g'-°(S*) g'-'(g') g°-°(g*) 

g orig. - 6.42 4.075 1.348 6.34 

a 0.99 6.47 ± 0.01 (Z=-4.50) 5.67 ± 0.06 (Z=-26. 06*) 5.58 ± 0.1 (Z=-36.07*) 5.56 ± 0.1 (Z=6.70*) 

b 0.99 6.75 ± 0.01 (Z=-29.62*) 5.69 ± 0.06 (Z=-28.68*) 5.63 ± 0.1 (Z=-43.18*) 5.62 ± 0.1 (Z=7.46*) 

c 0.97 6.47 ± 0.01 (Z=-5.34) 4.076 ± 0.005 (Z=-0. 32) 1.31 ± 0.01 (Z=3.33*) 6.39 ± 0.01 (Z=-4.53* ) 

d 0.98 6.59 ± 0.01 (Z=-13.55*) 4.072 ± 0.005 (Z=0.63) 1.366 ± 0.002 (Z=-8.27*) 6.39 ± 0.01 (Z=-5.13*) 
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FIG. 5: DAG representation of PhD advisors (inbox). A prototypic randomized network after 2^* iterations is present for each 
method: randomization preserving undirected degree sequence and component size distribution (a), randomization only pre- 
serving the undirected degree sequence (b), randomization preserving directed degree sequence and component size distribution 
(c) and randomization preserving only directed degree sequence (d). Panel e represents the dissimilarity respecting the original 
network along the process of randomization for every randomization type. The mean and the standard deviation of 500 graph 
randomizations are shown for each point. 



V. DISCUSSION 

In this paper we present a set of four algorithms based 
on an iterative process of rewiring for the construction 
of DAG random models. The difference between algo- 
rithms stems from two topological invariants under con- 
sideration, namely, the conservation of the directed de- 
gree sequence and/or the conservation of the connected 
component distribution. In contrast to other methods of 
random model construction, this approach works within 
the space of graphical solutions providing a feasible com- 
putational approximation for the exploration of such a 
graphical space considering a defined number of topolog- 
ical invariants in the null-model ensemble generation. 

Our methodology was evaluated through the analyses 
of both extreme and real graphs comparing them with 
their associated randomized ensembles using two mea- 
sures: dissimilarity and joint entropy. While the former 
indicates whether connections are actually changed af- 
ter randomization, the latter quantifies the disorder or 
uncertainty in the degree-degree relations, thereby being 
an indicator of randomness. In this context, it is worth 



to mention that other measures such assortative mixing 
[25! . [26j or mutual information [U have been suggested 
for the evaluation of degree-degree relations. In essence, 
these measures compare the actual degree correlations re- 
lation in the graph with the expected one obtained from 
the remaining degree information. A problem arises when 
a proper definition of remaining degree attending direct- 
edness needs of the information of the directed degree 
sequence because of the latter is a topological feature 
not preserved in all of our methods (methods a and b). 
Therefore, measures based on remaining degree informa- 
tion, although extensively used as estimators of degree- 
degree relations in the network literature [25l - l27j cannot 
be applied in this work for a comparative evaluation of 
our methodology. 

To overcome these limitations, joint entropy was used 
as a raw measure of uncertainty once defined to be ap- 
plied to directed graphs leading to four alternative de- 
scriptors according to in and out-degree information. 
Furthermore, the significance of the variation of degree- 
degree relations between the random ensembles and the 
original graph was evaluated using a Z-score estimator. 
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The analysis of network models verified that our meth- 
ods do not produce a bias when applied to the random- 
DAG model whilst they produced a significative increase 
of disorder of the degree-degree relations on the snake- 
DAG model when randomized -see table|lll Going to real 
systems, our analyses revealed that all the methods pro- 
duced an greater than its respective original value, 
suggesting that randomizations disorder the underlying 
graph and they do not only affect the pattern of arrows. 
However, values of undirected joint entropies are differ- 
ent among methods, suggesting that conservation of the 
DAG condition and the remaining topological invariants 
have a variable impact on the underlying network. When 
we look at the directed joint entropies a general increase 
of the values was observed for all of methods, although 
some exceptions are observed. This is the case of PhD 
student-advisor network where the original network ex- 
ibits a diversity of degree-degree relations higher than the 
randomized ensemble. 

Additionaly, our results show that preserving the com- 
ponent size structure is an important aspect to take into 
account since it has drammatic effects when the network 
is markedly sparse. This is the case of C. elegans and 
PhD student-advisor DAGs by which randomizations not 
preserving the component size produced a graph frag- 
mentation. On the contrary, high average degree guar- 
antees the preservation of the giant component and ran- 
domization methods. In such circumstances, methods a 
and b give comparable values of joint entropies. Anal- 
ogously, this is also sobserved for method c and d (see 
joint entropy for network models and also for the Mil- 
gram's citation network). This feature indicates that the 
conservation of component structure premise is not rele- 
vant and produce indistinguisable topologies when graph 
fragmentation is unlike to happen. 

Another important observation is related to the small 
values displayed by standard deviations in joint entropies. 
For methods c and d are one order of magnitud lower 
than the ones obtained for methods a and b . It suggests 
that just directed degree sequence conservation is enough 
to severely reduce the space of graphical configurations. 
Consistenly, it was observed that, in general, methods 
c and d provided lower Z-values than a and b. How- 
ever, the small divergence of the obtained values is not 
explained by a non-effective rewiring since high values 
of dissimilarity were reached. An interesting exception 
was found in the Milgram's citation network where dis- 
similarity values after processes of randomization were 
markedly lower than the observed in the other real net- 
works, as well as in the toy models. An explanation can 
be found in the presence of superhubs, nodes whose con- 
nectivity is ~ I). This introduces a strong constraint 
in the rewiring, difficult -even impossible- to overcome. 
Nevertheless figure ^ illustrates that the original net- 
work seems to be partitioned in two regions. Using the 
same layout for randomized graphs, we observed that 
such a partition was lost, suggesting that rewiring pro- 
cess was accounted. Contrasting to this behaviour, C. el- 



egans randomized ensembles were completely suffled -as 
indicated by the high values of dissimilarity- but degree- 
degree relations were not always significatively altered. 
This is specially evident when directed degree sequence 
is conserved. An explanation can be obtained by the fact 
that this DAG is practically a dichotomic tree. This net- 
work is sparse enough to be fragmented as it happened in 
method b. Nevertheless, preserving its extremal directed 
degree sequence was enough to conserve the number of 
components (not their size though). This is in aggree- 
ment to the constraint in the number of DAG compo- 
nents described in eq. ([7]). In fact, this is a result of the 
limited space of possibilities permited by the extremal 
directed degree sequence and therefore very little varia- 
tion is found in the joint entropies (notice the case of zero 
i/*'* for method c). Interestingly, when directed degree 
sequence and component structure are not preserved, tree 
configuration is unlikely to happen by chance. However, 
tree structure is practically the only solution when di- 
rected degree sequence is preserved even not conserving 
the component size distribution. 

The choice of topological constraints (i.e. the particu- 
lar method ) for a desired randomization process depends 
upon the question the researcher wants to explore, rather 
than upon a technical issue. Preserving the directed de- 
gree sequence captures the need to fix the number of 
inputs and outputs for every element. Randomizations 
attending to this constraint (for example, in a techno- 
logical system) may be interpreted as a rewiring of an 
electronic circuit by a random assembling of integrated 
devices (e. g. chips) but respecting the inputs and out- 
puts of the components. This contrasts with the softer 
undirected degree sequence invariant produced by pre- 
serving just the number of connections in every node. 
In this case, the relevance relies on the number of re- 
lations instead of mattering the arrows orientation -i.e., 
the undirected degree sequence. Furthermore, the con- 
servation of the connected components is essential in a 
graph describing a process, since fragmentation can be 
intepreted as a break of the flow of causality. 

Finally, we stress that an important feature of any ran- 
domization process is that topological invariants restrict 
the space of graphical solutions. Our methodology pro- 
vides valuable information about the randomness of a 
particular structure within the context of its graphical 
space of solutions. It is arguable to think that the higher 
the number of constraints the smaller the space of solu- 
tions. In any case, its complete exploration is not feasible 
beyond a graph containing more than a handful of nodes. 
In this context, our methodology provides a sampling of 
such space in order to estimate the randomness of a DAG 
given some topological contraints 
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system. The former case strictly looks for linear relation- 
ships and it is supposed to be a more appropriate mea- 
sure for normally distributed data. Furthermore both ap- 
proaches naturally require of certain degree-degree vari- 
ance within the graph. For instance, a large feed-forward 
single chain of nodes has a strong degree-degree deter- 
minism that none of these two measurements would cap- 
ture. The reason is that most of the degree-degree pairs 
would be (2, 2) for undirected and (1, 1) for any directed 
degree analyses. In this sense, degree-degree joint entropy 
provides a suitable measure of the relation or determin- 
ism of degree-degree relations with neither parametric 
assumptions nor degree-degree variance requisites. Ac- 
cordingly, we used the concept of degree-degree relations 
instead of degree-degree correlations. 
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